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DETECTING  THE  SHIFT  IN  THE  PROBABILITY  OF 
SUCCESS  IN  A SERIES  OF  BERNOULLI  TRIALS 


S . Zacks 
Z.  Barzily 


I . Introduction  and  Summary 


This  paper  studies  the  problem  of  controlling  the  success  prob- 
ability of  a sequence  of  independent  Bernoulli  trials.  More  specif- 
ically, a sequence  of  Independent  Bernoulli  trials  starts  with  a success 
probability  0 , 0 < 0 < 1 , and  at  an  unknown  epoch  the  success  prob- 
ability shifts  to  (j)  greater  than  6 . The  present  study  is  devoted  to 
the  development  of  a stopping  rule  when  both  9 and  (()  are  unknown. 

We  study  the  problem  in  a Bayesian  framework  and  show  the  nature  of  the 
optimal  dynamic  programming  solution  when  one  is  penalized  for  early  or 
late  stopping.  The  nature  of  the  Bayesian  optimal  stopping  rule  when 
6 and  4i  are  known  was  established  previously  by  Sirjaev  [2]. 

When  the  success  probabilities  are  unknown  the  optimal  solution  is  consid- 
erably more  complicated.  This  paper  shows  how  approximate  solutions  can 
be  obtained  and  applied  effectively.  A series  of  numerical  illustrations 
shows  the  effectiveness  of  the  proposed  procedure  in  some  simulated  cases. 


The  problem  studied  here  was  motivated  by  a problem  of  determining 
the  epoch  of  change  in  the  readiness  of  systems.  The  results  of  this 
study  can  be  applied  to  a variety  of  other  problems  of  applied  Interest. 
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The  paper  Is  comprised  of  six  sections.  The  formulation  of  the  Bayesian 
framework  and  the  likelihood  functions  is  carried  out  in  Section  2. 

Section  3 provides  the  dynamic  programming  formulation  of  the  optimal 
stopping  rule.  The  case  of  known  success  probabilities  is  discussed  in 
Section  4.  Section  5 discusses  the  convergence  of  the  algorithm  when  the 
success  probabilities  are  unknown,  and  the  results  of  some  simulations 
are  given  in  Section  6.  These  simulations  strongly  indicate  that  the 
proposed  detection  procedure  is  quite  effective.  It  is  generally  very 
difficult  to  detect  shifts  on  the  basis  of  small  sequences  of  0-1 
Bernoulli  trials. 

2.  Likelihood  Functions,  Prior  and  Posterior 
Distributions 

Let  Xj^,X2,...  be  a sequence  of  independent  Bernoulli  random 
variables,  i.e.,  each  can  assume  the  values  0 or  1 , and 

PfX^=l]  = C'^  , i=l,2,...  , where  6^  is  the  probability  of  "success." 

Tliis  paper  considers  the  problem  of  one  shift  in  the  values  of  6^  . 

More  specifically,  let  x = 0,1,2,...  and 

0 , if  i £ T 

0.  = , (2.1) 

({)  , if  1 > T 

where  0 < 6 < (J)  < 1 . Thus  x is  the  epoch  of  shift  from  0 to  • 

This  epoch  of  shift  is  unknown. 

The  approach  here  is  Bayesian,  and  accordingly,  the  parameter 
values  (x,0,4))  are  considered  as  random  variables.  Moreover,  it  is 
assumed  that  x is  priorly  independent  of  (0,<|))  , having  a prior  p.d.f. 
i^(x)  concentrated  on  the  nonnegative  Integers.  The  parameters  (0,ij)) 
have  a prior  distribution  over  the  simplex  0 < 0 £ < 1 , with  a prior 

p.d.f.  h(0,(j))  . In  the  present  paper  we  focus  our  attention  on  a geo- 
metric prior  distribution  for  x of  the  form 
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(]-irQ)p(l-p) 


if  T = 0 


if  T > 1 


(2.2) 


where  0 < tt  , p < 1 . This  prior  distribution  was  introduced  by  Sirjaev 

in  [2].  The  likelihood  function  of  (t,6,4>)  , given  observations  on 

X = (x, , . . . ,x  ) , is 
~n  1 n 

n-1  T.  j-T.  n-j-T^”^ 

L(T,e,(});  X ) = I l{T=j}  e J (1-6)  (j)  J (1-(|))  ""J 

j=0 


(2.3) 


T n-T 

+ 1{t  > n}  0 " (1-9)  " , 


where  T = J x.  , j=l,...,n  , = T - T.  and  T.  = 0 . This  like- 

.1  1 n-j  n j 0 

lihood  function  provides  the  information  in  the  sample  on  the  parameters 

(T,0,(())  . If  6 and  (J)  are  known  we  consider  the  likelihood  as  a function 

of  T . The  posterior  p.d.f.  of  (T,0,<t)  , given  x , can  be  obtained  from 

~n 

the  likelihood  function  (2.3)  and  Bayes'  formula.  This  posterior  p.d.f.  is 


(t,9,(|);  X ) = ip(T)  h(e,<t>)  L(r,e,(p;  x ) /d  (x  ) 
~n  ~n  / n ~n 


(2. A) 


where 


D = I <PU)  f f L(j,6,(J);  X ) h(0,(|.)  d(})d9  . 

j=0  0 0 


(2.5) 


Other  functions  of  interest  are  the  posterior  probability  of  {t<n}  given 

X , and  the  posterior  probability  of  future  success  given  x , i.e., 

»n  »,n 

P{x^^j=l  I x^}  , j=l,2,...  . The  posterior  probability  of  {T<n}  given 
x^  is  used  to  decide  whether  to  stop  the  process.  We  denote  it  by 
Ti^(x^)  and  compute  it  according  to  the  formula 


tp*  1 1 

= 1 - X / f L(n,e,<t>;  X ) h(0,4>)  d(J)d0  , 


(2.6) 
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We  will  show  later  that  under  general  conditions  on  h(9,<J))  the 

posterior  probability  ir  (x  ) converges  to  1 a.s.,  as  n ^ . Given 

n ~n 

, the  future  probability  of  success  is,  for  all  j=l,2,... 

p[x^_^.j=l  I x^j  = e{0  I{t  > n+j}  I + e{4>  I{t  < n+j-1}  | x^}  . 

(2.7) 

This  probability  can  be  expressed  in  terms  similar  to  those  of  (2.6). 
Explicit  expressions  will  be  considered  later. 


3.  Optimal  Stopping  Times 

The  problem  of  detecting  the  shift  point  T can  be  approached 
from  a decision  theoretic  point  of  view  in  the  following  terms.  After 
each  observation  we  have  the  option  to  stop  sampling  and  declare  that  the 
shift  has  already  occurred.  The  process  is  then  Inspected,  and  if  the 
shift  has  not  yet  occurred  a penalty  of  $C2  is  imposed  for  undue  stopping. 

On  the  other  hand,  if  we  do  not  stop  and  the  shift  has  already  occurred, 
we  are  penalized  $0^^  for  each  additional  observation.  We  develop  here 

a characterization  of  the  optimal  stopping  rule,  using  the  methods  of 
dynamic  programming.  We  start  with  a truncated  version  and  impose  the 
restriction  that  stopping  must  occur  after  a finite  number  of  observations. 


Let  R^^^(x  ) denote  the  minimal  posterior  risk  after  n observa- 
n -n 

tlons,  given  x^  , when  at  most  j more  observations  are  allowed.  For 
j=0  we  have 


R^^^x  ) = C_(l  - TT  (x  ))  . 
n ~n  2V  n ~n  / 


(3.1) 


If  one  more  observation  is  allowed,  i.e.,  j=l  , the  posterior  risk  asso- 
ciated with  observing  x is 

n+i 

r'S^^(x„)  = C^TT  (x  ) + C„(l  - E{7T  ,,  (x  .,)  1 X })  . (3.2) 

n -n  1 n 2V  n+1  -n+1  ' ' 
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- 4 - 
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, since  ’^n+l^*n+l^  ~ | } we  obtain 


Moreover, 


the  ]:^w  of  iterated  expectation  that 


^IVl^^n+1^  ' ^nl  = Ejl{T<n}  | xj  - tt^(x^)  + (l  - tt^(x^))p  • 0.3) 


Using  (3.2)  and  (3.3)  we  obtain 


R^^^x  ) = min(R^°^(x  ) , R^^^x  )| 
n ~n  ( n ~n  n ~n  ) 

= C„fl  - IT  (x  ))  + TOinfo,  C, TT  (x  ) - C„p(l  - TT  (x  )))  . 
2\  n ~n  / y 1 n ~n  2*^  V n ~n  /y 


(3.4) 


Obviously,  R^^^(x  ) < R^^^x  ) for  all  x . It  is  optimal  to  stop 
n ~n  --  n ~n  ~n 

after  n observations,  when  j=l  , if  and  only  if 


SP 

\^^n^  “ ^2P 


(3.5) 


If  j=2  , then  according  to  the  optimality  principle  of  dynamic 
programming  it  is  optimal  to  stop  after  n observations  if  and  only  if 

R^*^\x  ) < C IT  (x  ) + EjF^^^(x  . 1 ) I X I . Moreover, 
n ~n  — 1 n ~n  ( n+1  ~n+l  ~n) 


^{•^n+l^^n+l^  I ^nj  S ^n+l^?n+l^  ^ ~nl 

+ Ejmin^O,  Vl<^n+1^  ’ P^(^  ~ Vl^^n+lO)  > ^n) 

= r fl  - 7T  (x  ))(l-p)  + M^^^x  ) , (3.6] 

\ n ~n  / n ~n 


where 


= Ejmin^O,  ~ ~ Vl^^n+lO)  > ^n  j ' 


(3.7) 


The  minimal  posterior  risk  is  then 


R^^^x  ) = C (l  - >r  (X  )) 

n ~n  2\  n ~n  / 

+ min/o,  C IT  (x  ) - C p(l  - it  (x  ))  + M^^\x  ))  . (3. 

y 1 n ~n  2 \ n ~n  / n ~n  / 
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. J 


Notice  that  ) < 0 for  all  x and  therefore  R^^^(x  ) < 

n ~n  - n ~n  - 


,(U 


‘ furthermore,  for  j=2  it  is  optimal  to  stop  after  n 


observations  if 


C„p  - ) 

7T  (x  ) > --= 2 

n ~n  - + C^p 


(3.10) 


The  stopping  boundary  for  J=2  fthe  RHS  of  (3.10)]  is  not  smaller  than 

that  for  j=l  [the  RHS  of  (3.5)].  Thus,  if  rr  (x  ) < C^p/(C,  + C,p) 

n ~n  2 1 2 

it  is  optimal  to  continue  and  take  at  least  one  more  observation. 


In  a similar  fashion  we  obtain  by  backward  induction  that,  for 
all  j > 2 , 


R^J\x  ) = C (1  - TT  (x  )) 
n ~n  2'  n ~n  ' 


+ ™l„(o.  - CjpCi  - + H«-U(x^))  , 


(3.1] 


where,  for  each  i 2 , 


M^^^(x  ) 
n -n 


( / 


Lemma  1 : 


M^^\x  ) < ^\x  ) 

n ~n  - n ~n 


(3.13 


with  probability  one  for  all  i=l,2,...  and  all  n=l,2,...  , where 


M^”\x  ) E 0 . 
n 


Proof:  The  proof  is  by  Induction  on  i . Since  M^^^(x  ) < 0 

n ~n  — 


for  all  X and  all  n , we  obtain  that 
~n 


(3.14 
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for  all  X ..  . Hence,  the  conditional  expectations  of  the  two  sides 
~n+l 

(2) 

of  (3.14),  given  x , preserve  the  inequality.  That  is,  M (x  ) < 
~n  n ~n  ~ 

M^^^(x  ) , for  all  n=l,2,...  and  all  x . If  we  assume  that 
n ~n  ~n 

) for  all  k=l,...,i  , all  n=l,2,...  , and  all 

n ~n  ~ n -n 

X , we  obtain  that 
~n 


min(0,  - C^pCi  - + M^iJ^^n+lO 

< min(o,  - C2P(1  - Vl^^n+l^)  ^5n+l^) 


(3.15) 


The  conditional  expectation  of  the  LHS  of  (3. ‘5),  given  x , is 

~n 

m(1'*'1)(x  ) and  that  of  the  RHS  is  M^^^(x  ) . Thus  we  have  established 
n -n  n ~n 

that  ) < M^^\x  ) for  all  1=1,2,...  , all  n=l,2,...  , and 

n ~n  — n ~n 


all  X 
~n 


Q.E.D. 


Define  the  boundary  values 

/j-1) 


b^^^x  ) = min 
n ~n 


c^p  - m; 


(^n> 


, 1 


.1=1,2 


(3.16) 


When  there  are  only  j optional  observations  it  is  optimal  to  stop 

sampling  if  TT  (x  ) > b^^^x  ) . Notice  that  the  inequality  M^^^(x  ) < 
n -s.n  — ^ - ” " - — 


n -n 


n ~n 


^\x  ) implies  that  b^^^(x  ) > b^^  ^\x  ) for  all  j=l,2,...  with 
n ~n  n ~n  — n ~n  j , , 

probability  one.  Since  the  sequences  b^^\x  ) are  bounded  from  above 

n ~n 

(1) 

by  1 , 11m  b •’  (x  ) exists  for  each  n and  each  x . Denote  these 
. n ~n  ~n 

3-^ 

limits  by  • 31ie  stopping  rule  which  calls  for  stopping  if 

Ti  (x  ) > b (x  ) is  an  optimal  solution  of  the  untruncated  problem.  This 
n ~n  — n ~n 

is  due  to  the  fact  that  the  risk  is  bounded  below  by  zero  (see  [1]).  We 
notice  In  (3.16)  that  for  all  n and  all  x^  , b^(x^)  > C2P/(Cj^  + C2P) 
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Hence,  if  ir  (x  ) < C»p/(C,  + C„p)  it  is  optimal  to  continue.  The 
n ~n  I i.  1 

question  is  how  to  determine  b (x  ) when  x is  such  that  7r  (x  ) > 

n ~n  -n  n -n  — 

C2P/(C^  + C^p)  . 


4.  The  Case  of  Known  Success  Probabilities 

Consider  the  case  of  known  success  probabilities  0 and  ()>  . 

This  is  a special  case  of  the  general  Bayesian  model,  in  which  the  prior 
distribution  of  (0,4>)  is  concentrated  on  a specific  point  (0q,<|)q)  • 

This  special  case  can  be  applied  to  practical  control  problems  in  which  a 
rectification  of  a process  is  needed  whenever  <t>  ^ <Pq  the  Interval 

is  a "region  of  indifference."  Given  x , the  posterior  prob- 
u u ~n 

ability  of  {x  < n}  for  this  special  model  is: 

TT  (x  ) 
n ~n 


= 1 - 


(1-TT^)  (1-p) 


”00  ” ^ <-v  0 


^(n) 

*^l\  ” ^ n-j  . \n-l 

— I w + (1-TT  )(l-p) 


(4.1) 


where  = OQ/d-e^)  , and  o)  = (1-4>q) / (1-0^)  . 

Let  IT  (x)=l-n(x).  We  can  easily  establish  the  recursive 
n ~n  n ~n 


relationship : 


"n+l^^n+1^  = 


T„(X  ) (1-p) 
n 


Pi  x’^n+l 


0)  + IT  (x  ) (1-p) 
n ~n 


(4.2) 


This  recursive  relationship  shows  that  when  0 and  (Ji  are  known,  the 

{tt  (x  );  n > 1}  process  is  Markovian.  In  order  to  determine  the  value 
n ~n  — 

of  tt  , 1 (x  . , ) it  is  sufficient  to  know  the  value  of  tt  (x  ) and  the 
n"ri  -.n+i  n -n 
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value  of  X . Let  ir  (x  ) = it  and  X ,,  = Y , and  let 
n+1  n ~n  n+1 


iji(Tr.Y)  = 


+ (l-ir)p) 


Z w(7T  + (l-ir)p)  + (l-TT)(l-p) 


(4.3) 


where  Z = p,/p„  and  i|i(Tr,Y)  is  IT  . , (x  ,Y)  as  a function  of  tt  and  Y . 
i u n+i  ~n 

Thus,  the  sequence  n l}  is  a stationary  Markov  sequence  in 

the  sense  that,  given  it  (x  ) = tt  the  distribution  tt  (x  ,,  ) for  all 

n ~n  n+k  ~n+k 

k > 1 is  independent  of  n . Furthermore,  one  can  readily  prove  that 

{ti  (x  );  n > 1}  is  a submartingale  with  respect  to  the  Bayes  predictive 
n ~n  — 

distributions  of  x (n  > 1)  [see  the  denominator  on  the  RHS  of  (4.1)]. 

~n  ~ 

Thus,  with  respect  to  these  Bayes  predictive  distributions,  Tf  (x  ) -*■  1 

n ~n 

a.s.  (see  Sirjaev  [2,  p.  153]).  We  provide  in  the  following  lemma  a proof 
for  our  specific  problems  which  establishes  the  convergence  of 


one  whenever  a shift  occurs  at  a fixed  finite  time  point  t * k . The 
convergence  established  in  Lemma  2 is  with  respect  to  a sequence  of  dis- 
tributions with  fixed  parameters,  while  Sirjaev' s result  is  the  convergence 
a.s.  with  respect  to  the  prior  mixtures  of  such  distributions. 

Lemma  2:  When  6^  and  are  known,  and  if  T = k for  some 

k < <»  , then  tt  (x  ) ->  1 a.s.  as  n . 
n ~n 


Proof:  Let 


c - n ~n 

n 1 - TT  (x  ) • 
n —n 

It  is  sufficient  to  show  that  S ->  ® a.s.  [(|)„]  . According  to  (4.1), 

n U 


T n-1.  ,n-j  T 

M I, \ \ -J 


II  , V ^ L > . I 


(n) 

n-j 


(4.4) 


Obviously, 


S „ . (S  +p)Z  ^ ^ s z -2- 

n+1  n*^  1-p  - n 


9 - 


1-p 
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hence 


,(n+k) 


S .K  > S,  Z 
n+k  — k 


^.(n+k)  ^ 


,(n+k) 


(n+k) 


(1-p)"  n-T^ 

®o"  " 


T n-T 

The  function  w *'(l-o))  ” is  maximized  by  u)  = T^/n  . Hence 


T.(n+k) 

^^(n+k)^  n ^ ^(n+k)^  n 


,(n+k) 


,(n+k) 


1 1 . 


(l-6o) 


for  all  6 in  (0,1)  . Finally,  since  ■*■  <j>-  a.s.,  one  obtains 

n u 


(4.6) 


^(n+k)  ^_^(n+k) 

\ C " 

lim  S > lim  lim  — 7-^-r y - 

(1-p)"  n-> 

9o"  (i-e^)  " 


(4.7) 


,(n+k) 


= lim 


^^(n+k)^  n ^ ^(n+k)^  n 


,(n+k) 


n-K»  (1-p)  n-*«> 


,(n+k) 


,(n+k) 


(I-Oq) 


, a.s.  [ 0Q ] . 


Q.E.D. 


The  boundary  function  for  the  optimal  stopping  rule  depends  on  the 
value  of  TT  and  does  not  depend  on  n . Let  B(7t)  , 0 £ tt  £ 1 , denote 
the  boundary  function.  The  stopping  rule  requires  that  sampling  be  ter- 
minated as  soon  as  ^n^*n^  — proven  earlier,  B(tt)  £ it*  , 
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where  it*  = C„p/(C  + C p)  . Let  M (ir)  denote  the  function  ^ (x  ) , 
i X z n ~n 

1=1,2,...  and  let  M(tt)  denote  the  function  M (x  ) when  tt  (x  ) = ir  . 

n ~n  n ~n 

These  functions  do  not  depend  on  n . The  boundary  B(ti)  Is  defined  as 


B(tt)  - min 


/ C p - M(TT)\ 


< 1 , 


(4.8) 


.Similarly,  for  each  1=1,2,...  define 


(1)  ( 

« '(-)  = «nin^l  . ^ 


(4.9) 


The  sequence  {B^^^(tt);  1=1,2,...}  converges  monotonically  to 

B(tt)  for  each  tt  . Consider  the  functions  M^^^(ir)  and  B^^^(tt)  . 
According  to  (3.7), 

M^^\tt)  = E^jTnln(o  , Cj^i|t(tt,X)  - C^pCi  - iKTT,X)))j  , (4.10) 

where  tlie  distribution  of  X has  the  probability  function 

p(x;  TT)  = [TT  + (1-TT)p]({)q  (l-<j)Q)^"''  + (l-TT)^^“P^e^  (I-Oq)^"’'  . 

(4.11) 

We  can  easily  verify  that  for  each  value  of  X , t|t(tt,X)  is  a strictly 
increasing  function  of  it  and  that  tj;(Tr,l)  > tt  for  every  tt  in  (0,1)  . 
Thus,  if  TT  ^ TT*  , then  ijj(TT,l)  > tt*  , and  therefore  Cj^t|;(tt,1)  - 

C2P(1  - tj»(TT,l))  > 0 . Hence,  for  TT  tt*  , 

M^^^tt)  = P^{x=0}(c^iJ>(TT,0)  - C2P(1  - <j^(TT,0)))‘  , (4.12) 

where  a = mln(0,a)  ; and  P^{x=0}  = tt(1-(J)q)  + (1-tt)(1-6q)  . Substituting 

the  formula  for  t|;(TT,0)  into  (4.6)  yields 

n'T  , [C  u)(tt+p(1-tt))  - C-p(l-TT)(l-p)l” 

h'  ^ (i-.Hi-p) — • 

(4.13) 


I 
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Notice  that  M'  (it)  Is  a continuous  function  of  tt  and  that 
-►  0 as  Ti  > 1 . Correspondingly,  is  a continuous 

function  of  it  and  -*•  ti*  as  tt  ->■  I . 

According  to  the  recursive  equation  (3.12), 

M^^\tt)  = E^j(Cjip(Tl,X)  - C2p(l-i/^(7T,X))  + M^^^(i|^(TT,X)))~j  . (4.14) 

(2) 

Thus,  M (tt)  is  a continuous  function  of  tt  . In  a close  neighborhood 
of  1 , M^^^(tP(tt,X))  = 0 for  x=0,l  and  therefore  M^^^(ir)  = M^^\tt)  . 

By  induction  of  i we  show  that  M^^\ir)  is  a continuous  function  of 
TT  for  all  1=1,2,...  and  that  M^^^(tt)  ->-0  as  tt  ->■  1 . Correspond- 
ingly, all  the  boundary  functions  B^^\tt)  are  continuous  and  converge 
to  TT*  as  tt  -»•  1 . Thus  we  can  show  that  B(tt)  is  continuous  and 
converges  to  tt*  as  tt  ->  1 . Therefore,  there  exists  a value  Tt^  such 

that  TT.  = B(tt,  ) and  for  the  first  n at  which  tt  (x  ) > tt,  it  is  op- 
i n ~n  — 1 

tiTnal  to  stop.  Sirjaev  [2,  pp.  149-155]  proved  this  result  in  a more 

general  context;  however,  he  has  not  determined  the  value  of  tt^  . We 

have  shown  that  in  our  framework,  ^ tt*  . 


5.  Unknown  Success  Probabilities  with  Uniform  Prior 

In  this  section  we  investigate  the  nature  of  the  decision  process 
when  the  prior  distribution  of  the  unknown  (6,(|>)  is  uniform  over  the 
simplex  0 < 0 ^ (J>  < 1 , l.e.,  the  prior  p.d.f.  is 


h(0,(|))  = 2I{0  < 0 < 4)  < 1} 


(5.1) 


The  posterior  p.d.f.  of  (0,4>)  is 


I >l^(j)L(j  ,0,4);  X ) 


g(0,4>  I x^)  = I{0  < 0 < 4)  < 1} 


n 


(5.2) 
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Here 


“>  T n-T 

I X ) = TT  (j)  " (l-cfi)  " 

j=0 


n-1 


j-T,  T 


(n) 


+ (l-TT)p  I (1-p)^  ^ 0 j (1-0)  ^ J (1-4)) 

j=l 


+ (l-7r)(l-p)"~^  0 " (1-0) 


(5.3) 


and  the  function  obtained  by  integrating  (5.3)  over  the  range 

0 ^ 0 £ 4>  ^ 1 . Accordingly,  we  obtain  after  some  algebraic  manipulations 


D (x  ) = TT  B(T  +2  , n-T  +1) 
n ~n  ' n ’ n 


,(n) 


n-1  , , B(t'-  '+1  , n-j-T'  '+!)  n-j  B(T  -i+1  , n-T  +i+2) 

j=l  ^ 1=0  B(T^  ^-i+1  , n-j -T^  ^+1+2) 

n-j  ’ n-j 


+ (l-ir)(l-p)"“^  B(T  +1  , n-T  +2)  , 
n n 


(5.4) 


where  B(Vj^,V2)  = r(Vj^)r(V2)/r(v^+V2)  is  the  beta  function.  The  posterior 

probability  tt  (x  ) can  be  determined  by  the  formula 
n ~n 

TT  (x  ) = 1 - (1-TT)(1-P)"~^  B(T  +1  , n-T  +2)/D  (x  ) . (5.5) 

n -n  n n n 

If  we  denote  by  Y the  result  of  the  (n+l)st  trial,  and  if  ir  (x  ) = 

n ~n 

1 - IT  (x  ) , then  we  obtain  from  (5.5)  the  expression 
n 


(1-p)  TT  (x  ) D (x  ) B(T  +14-Y  , n+3-T  -Y) 
n ~n  n ~n n n 

B(T  +1  , n-T  +2) 
n+i  ~n  n n 


(5.6) 


More  specifically. 


(l-p)TT  (x^) 


T +1  D (x  ) 
n . n ~n 


n ~n  n+3  D , (x  ,1) 
n+i  ~n 


(5.7) 


and 
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n+2-T 


n 


(5.8) 


From  the  basic  definitions  we  can  establish  that 


P[x  ,1=1  I X ] = _ / \ 

n+1  ' -n  D (x  ) 


n ~n 


(5.9) 


According  to  (3.3),  the  process  {tt  (x  );  n > 1}  constitutes  a 

n ~n  — 

submartlngale  with  respect  to  the  Bayes  predictive  distributions  of 

Thus,  11m  ir  (x  ) exists  with  probability  one,  and  as  in  Sirjaev  [2, 
n-w 

p.  155]  we  can  show  that  llm  tt  (x  ) • 1 a.s.,  with  respect  to  the 


X 

~n 


Bayes  predictive  distributions.  In  the  following  lemma  we  prove  this 
convergence  for  cases  of  fixed  (6q,(J)q)  and  x = k (finite). 

Lemma  3:  If  x = k for  some  k < “>  then  n (x  ) -►  1 a.s.  as 
n ~n 

n ->  oo  . 


Proof;  We  write 

1 1 

TT  (x  ) = / / P[x<n-1  I X ,0,(|)1  g(6,(J)  I X )d(J)d0  . (5.10) 

" 0=0  4)=0 

We  see  in  (4.7)  that  as  n •>  “>  then  P[x<n-1  | x^,0,(t)]  ->■  1 a.s.,  uni- 
formly in  (0,(j>)  . Accordingly,  there  exists  N(6)  such  that  for  all 
n > N(6)  , P[x<n-1  | x^,0,<|)]  ^ 1-6  . Thus,  from  (5.10), 


i 1 

TT  (x  ) ^ (1-5)  / / g(0,4>  I X )d<])d0  = 1 - 6 . 

n -mIi  ^ 


(5.11) 

Q.E.D. 


Note  that  the  above  proof  does  not  depend  on  the  assumption  of  the  uni- 
form prior  distribution  of  (0,ij))  . 


III  « 1 
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6.  Some  Numerical  Examples 

In  this  section  we  provide  several  numerical  Illustrations  of  the 
stopping  rule: 

N = least  n > 1 such  that  tt  (x  ) > ) , (6.1) 

~ n ~n  — n ~n 

for  the  case  where  9 and  4)  have  a uniform  prior  on  the  simplex 
0 < 6 £ 4>  < 1 , and  the  shift  parameter  t has  a geometric  prior  dis- 
tribution. Each  illustration  is  based  on  an  independent  simulation  of  X 
values,  in  which  is  a Bernoulli  random  variable  with  a specified 

parameter  G if  n £ x and  with  parameter  (()  otherwise.  The  simulation 
in  a given  run  is  continued  until  the  decision  rule  calls  for  stopping. 

One  hundred  independent  replicas  were  run  in  each  case,  and  the  empirical 
frequency  distribution  of  the  stopping  locations  was  recorded.  In  Table 
) we  present  these  frequency  distributions  for  cases  in  which  X = 10  , 

It  = 0.01  , p = 0.01  , 0 = 0.3  , and  X = ~ varied  the 

parameters  4)  over  the  range  .5  to  1.  in  order  to  illustrate  the 
effect  of  4’  oil  the  speed  of  detection.  The  above  parameters  u and 
p were  chosen  small  in  order  to  lessen  their  effect  on  the  stopping 
times.  The  value  of  X was  chosen  sufficiently  small  to  reduce  early 
stopping. 

As  indicated  in  Table  1,  the  distribution  of  stopping  locations 
after  the  shift  has  occurred  tends  to  concentrate  near  the  point  of  shift 
as  4>  increases.  This  is  expected,  since  large  values  of  4>  frequently 
yield  the  value  x = 1 . On  the  whole,  it  seems  that  the  stopping  rule 
(6.1)  is  sensitive  and  its  performance  can  be  controlled  by  varying  the 
parameters  tt  , p , and  X . The  number  of  replicas  on  which  Table  1 
is  based  is  too  small  for  definitive  comparisons  of  the  stopping  time 
distributions.  To  establish  these  distributions  more  accurately,  either 
extensive  simulations  or  a different  numerical  approach  is  needed. 
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TABLE  I 

EMPIRICAL  FREQUENCY  DISTRIBUTIONS 
OF  STOPPING  RULE  (6.1) 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

3 

2 

0 

3 

4 

5 

0 

0 

0 

0 

1 

1 

5 

4 

5 

7 

5 

6 

6 

5 

3 

5 

3 

4 

3 

2 

3 

2 

2 

1 

4 

8 

6 

7 

7 

7 

1 

13 

12 

7 

10 

10 

1 

8 

15 

23 

31 

35 

7 

20 

15 

15 

26 

28 

6 

7 

11 

15 

6 

3 

6 

5 

10 

7 

3 

0 

5 

7 

8 

5 

2 

0 

1 

6 

3 

2 

0 

0 

3 

1 

3 

1 

0 

0 

4 

3 

4 

1 

0 

0 

4 

4 

2 

0 

0 

0 

2 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

3 

2 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

In  one  case  here  the  decision  rule  did  not 
call  for  stopping  even  after  30  observations. 
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